Ontology-driven discourse analysis for information extraction
نویسندگان
چکیده
This paper presents a novel approach to discourse analysis within information extraction systems. It makes use of DRT as formal representation of the linguistic context as well as of a domain-specific ontology as a basis to compute conceptual relations between extracted events thus establishing discourse coherence. The approach has been implemented within GenIE, an information extraction system with the aim of extracting information about biochemical pathways, about sequences, structures and functions of genomes and proteins. The approach is evaluated against a semantically hand-annotated set of Swiss-Prot protein function descriptions and shows very promising results. 2004 Elsevier B.V. All rights reserved.
منابع مشابه
Ontology-Driven Discourse Analysis in GenIE
This paper presents a novel approach to discourse analysis within information extraction systems. It makes use of DRT as formal representation of the linguistic context as well as of a domain-specific ontology as a basis to compute conceptual relations between extracted events thus establishing discourse coherence. The approach has been implemented within GenIE, an information extraction system...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملA protocol for constructing a domain-specific ontology for use in biomedical information extraction using lexical-chaining analysis
In order to do more semantics-based information extraction, we require specialized domain models. We develop a hybrid approach for constructing such a domain-specific ontology, which integrates key concepts from the protein-protein– interaction domain with the Gene Ontology. In addition, we present a method for using the domain-specific ontology in a discourse-based analysis module for analyzin...
متن کاملAutomatic Annotation of Discourse and Semantic Relations Supplemented by Terminology Extraction for Domain Ontology Building and Information Retrieval
In this article, we develop a framework for the building of domain ontologies and a semantic index based on two technologies: terminology extraction with LEXTER (© EDF R&D) and discourse and semantic annotation with EXCOM. We have selected two specific points of view for this study: causality and part-whole notions. In the first part of this paper, we explain the contributions of a terminology ...
متن کاملOntology-Driven Information Systems: Challenges and Requirements
The increased use of ontologies in several application fields makes it possible to observe requirements for their smooth integration within Information Systems. In this paper we analyse these requirements and propose the usage of additional semantic knowledge in the ontology to reconcile them. We think that these properties are essential to enhance the performance of ontology-driven Information...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Data Knowl. Eng.
دوره 55 شماره
صفحات -
تاریخ انتشار 2005